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In the paper we compare the modelling ability of discrete-time multi- 
variate Stochastic Volatility models to describe the conditional correlations 
between stock index returns. We consider four trivariate SV models, which 
differ in the structure of the conditional covariance matrix. Specifications 
with zero, constant and time- varying conditional correlations are taken into 
account. As an example we study trivariate volatility models for the daily 
log returns on the WIG, S&P 500, and FTSE 100 indexes. In order to 
formally compare the relative explanatory power of SV specifications we 
use the Bayesian principles of comparing statistic models. Our results are 
based on the Bayes factors and implemented through Markov Chain Monte 
Carlo techniques. The results indicate that the most adequate specifica- 
tions are those that allow for time- varying conditional correlations and that 
have as many latent processes as there are conditional variances and co- 
variances. The empirical results clearly show that the data strongly reject 
the assumption of constant conditional correlations. 

PACS numbers: 89.65 Gh, 05.10 Gg 



1. Introduction 

There are a lot of theoretical and empirical reasons to study multivari- 
ate volatility models. Analysis of financial market volatility and correla- 
tions among markets play a crucial role in financial decision making (e.g. 
hedging strategies, portfolio allocations, Value-at-Risk calculations). The 
correlations among markets are very important in the global portfolio di- 
versification. 
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The main aim of the paper is to compare the modelling ability of discrete- 
time Multivariate Stochastic Volatility (MSV) models to describe the condi- 
tional correlations and volatilities of stock index returns. The MSV models 
offer powerful alternatives to multivariate GARCH models in accounting 
for properties of the conditional variances and correlations. Superior per- 
formance of bivariate SV models over GARCH models (in term of the Bayes 
factor) are documented in 8 . But the MSV models are not as often used 
in empirical applications as the GARCH models. The main reason is that 
the SV models are more difficult to estimate. In this paper we consider four 
multivariate Stochastic Volatility models, including the specification with 
zero, constant and time-varying conditional correlations. These MSV speci- 
fications are used to model volatilities and conditional correlations between 
stock index returns. We study trivariate volatility models for the daily log 
returns on the WIG index, the Standard & Poor's 500 index, and the FTSE 
100 index for the period January 4, 1999 to December 30, 2005. In the next 
section the Bayesian statistical methodology is briefly presented . In section 
3 the model framework is introduced. Section 4 is devoted to the descrip- 
tion of trivariate SV specifications. In section 5 we present and discuss the 
empirical results. 



2. Bayesian statistical methodology 

Let y be the observation matrix and 9i be the vector of unknown pa- 
rameters and LOi the latent variable vector in model Mi (i = 1,2, ... , n). 
The i - the Bayesian model is characterized by the joint probability density 
function, which can be written as the product of three densities: 

p(y,Ui,9i\y(p),Mi) = p{y\u)i,0i,y(a), Mi)p(u>i\0i, Mi)p(9i\Mi), i = 1,2,... ,n, 

where yr ) denotes initial conditions, p(y\u>i, 9i, yr \, Mi) is the conditional 
density of y when u>i £ &i,0i G @i are given, p(uji\9i, Mi) is the density of 
the latent variables conditioned on 6i, p(9i\Mi) is the prior density function 
under Mi. The joint probability density function can be expressed as the 
product of the marginal data density of the observation matrix (given the 
initial conditions y( )) in model M^. p(y\y^, Mi), and the posterior density 
function of the parameter vector 9{ and the latent variable vector cjj in Mf. 
p(uJi,9i\y,y {0) ,Mi), i.e. 

P (y , w% , 0i | ?/(o) , Mi ) = p( toi , 0i | y , 2/(0) , M { )p(y \ y (0) , Mj ) , 

where 

p(y\y(p),Mi) = / p(y\uii,9i,y^,Mi)p(tOi,9i\Mi)duJid9i. 
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The statistical inference is based on the posterior distributions, while the 
marginal densities p(y|y(o), Mi) (i = 1,2, ... ,ri) are the crucial components 
in model comparison. 

Assume that M±, . . . ,M n are mutually exclusive (non- nested) and jointly 
exhaustive models. From Bayes's theorem, it is easy to show that the pos- 
terior probability of Mi is given by: 

where p(Mi) denotes the prior probability of Mj. For the sake of pairwise 
comparison, we use the posterior odds ratio, which for any two models Mi 
and Mj is equal to the prior odds ratio times the ratio of the marginal data 
densities * 

p{Mj\y,y i0) ) = pjMj) p(y\y(o),Mj) 
p(Mj|y,y (0 )) ~ p(Mj) ' p(y\y (0) , Mj) ' 
The ratio of the marginal data densities is called the Bayes factor: 

p(y\y(o),Mi) 



p(y\y(o), M j 



-i 



Thus, assuming equal prior model probabilities (i.e. p{Mi) = p(Mj)), the 
Bayes factor is equal to the posterior odds ratio. We see that the values 
of the marginal data densities for each model are the main quantities for 
Bayesian model comparison. The marginal data density in model Mj can 
be written as: 

p(y\y(o),Mi) = ( [ \p(y\vi,Oi,y(o),Mi)] p^e^y^^^^dwidQ, 

Of course, in the case of SV models this integral can not be evaluated 
analytically and thus must be computed by numerical methods. We use 
the method proposed by jS], which approximates the marginal data density 
by the harmonic mean of the values p(y\uJi,6i,y^, Mi), calculated for the 

observed matrix y and for the vector (wj , 0^)' drawn from the posterior 
distribution. That is: 

-l 

(1 "' 1 
_ E , (a) (a) 
m Up(yH 9) 4 q \y(o),Mi) 



The estimator p(y\y(o), Mi) is very easy to calculate and gives results that 
are precise enough for our model comparison. 
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3. Model framework 

Let Xjt denote the price of asset j (or index quotations as in our applica- 
tion) at time t for j = 1,2,3 and t = 1, 2, . . . , T. The vector of growth rates 
Vt = (yi,t,y2,t,y3,t)', each defined by the formula y jtt = 100 In (xtj/xj^-i), 
is modelled using the VAR(l) framework: 

Ut ~ 5 = R(y t -i - 6) + t = l,2,...,T, 



where is a trivariate SV process, T denotes the number of the observa- 
tions used in estimation. More specifically: 
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We assume that, conditionally on the latent variable vector and on the 
parameter vector 6i,£,t follows a trivariate Gaussian distribution with mean 
vector 0[ 3xl ] and covariance matrix Ej, i.e. 

et|nt(i),ei~JV(0 [3 xi],St), t = l,2,...,T. 

Competing trivariate SV models are defined by imposing different structures 
on E t . 

For all elements of 5 and R we assume the multivariate standardized Normal 
prior Af(0, 115), truncated by the restriction that all eigenvalues of R lie 
inside the unit circle. We assume that the matrix [5, R] and the remaining 
(model-specific) parameters are prior independent. 

4. Trivariate VAR(l) - SV models 

4-1- Stochastic Discount Factor Model (SDF) 

The first specification considered here is the stochastic discount factor 
model (SDF) proposed, but not applied, by [3]. The SDF process is defined 
as follows: 

it = e t \fh~ t , e t ~ MiV(0[3 Xl ],£), 
In h t = <p In h t -i + a h r\ u rj t ~ UN(0, 1), 

£j,t -L rjs, t,s e Z, j = 1,2,3, 

where Z = {. . . , —2, —1, 0, 1, 2, . . .}, _L denotes independence, and the sym- 
bol rjt ~ UN(0, 1) denotes a series of independently and normally distributed 
random variables with mean vector 0[3 X i] and covariance matrix S. In this 
case, we have 

61^(1), £ ~ (0[3xl]> ht'E), 
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where Sim) = ht- The conditional covariance matrix of £t is time varying 
and stochastic, but all its elements have the same dynamics governed by 
hf. Consequently, the conditional correlation coefficients are constant over 
time. Our model specification is completed by assuming the following prior 
structure: 

P{4>, °hM h o, S) = p(4>)p(al)p(lnh )p{^), 

where we use proper prior densities of the following distributions: 
0~ N '(0,100)I(_i !)(<£), a\ ~IG(1, 0.005), ln/i ~ N(0, 100), 
E~ IW(3I,3,3).' 

The symbol N(a, b) denotes the normal distribution with mean a and vari- 
ance b, I(_i i)(.) is the indicator function of the interval (—1, 1). IG{v$, sq) 
denotes the inverse Gamma distribution with mean sq/^q — 1) and variance 
Sq/[(i^q — 1) 2 (uq — 2)]. The symbol IW(B, d, 3) denotes the three-dimensional 
inverse Wishart distribution with d degrees of freedom and parameter ma- 
trix B. The initial condition for In ht (i.e. ln/io) is treated as an additional 
parameter and estimated jointly with other parameters. 

4-2. Basic Stochastic Volatility Model (BSV) 

Next, we consider the basic stochastic volatility process (BSV), where 
&|fi t (2) ~ N(0[ 3>< i], E t ), and E t = Diag(hi jt ,h2 tt ,h 3}t ) (similar to the idea 
of [2]). The conditional variance equations are: 

\nh jit - Jjj = Ojtilnh,,, | - + o-jjVjt, 

for j = 1,2,3, where T] t ~ UN(0 [3xl] , I 3 ), rj t = (m,t,m,t,m,t)' &t(2) = 
(hl,t, h-2,t, h-sj)' ■ For the parameters we use the same specification of prior 
distribution as in the univariate SV model (see 0), i.e. {ijj^jj)' ~ 
N(0,W0I)I { -i,i)(<Pjj), IG(1,0.005), In ^-0-^(0, 100), i = 1,2,3. 

4.3. J SV Model 

Both previous specifications (SDF and BSV) are very restrictive. Now, 
we propose a SV process based on the spectral decomposition of the matrix 
Sj. That is 

S t = PA t P-\ 

where At is the diagonal matrix consisting of all eigenvalues of 5^, and 
P is the matrix consisting of the eigenvectors of S^. For series {lnA^j} 
(j = 1,2,3), similarly as in the univariate SV process, we assume standard 
univariate autoregressive processes of order one, namely 



In \ j:t - n/jj = <.< ;; (ln Xj,t-i - Jjj) + o-j^ij.,. 



G 
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for j = 1,2,3 , where rj t ~ niV(0[ 3xl j, I 3 ), rj t = (m,t, m,t, m,t)', and O t(3) = 
(Ai,t, ^2,tj ^3,*)'- This reparametrization of S t does not require any parame- 
ter constraints to ensure positive definiteness of S t . If |^| < 1 (j = 1, 2, 3) 
then {In Ai^}, {In \2i,t} and {In A 3) t} are stationary and the JSV process is a 
white noise. In addition, P is an orthogonal matrix, i.e. P'P = I2, thus P is 
parametrized by three parameters (Euler angles) Kj £ (— it, n), j G {1, 2, 3}: 



P(«l, K 2 , K 3 ) = Pi{ki)P2{k 2 )P2,{kz), 



where for Z = 1 , 3 
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In this case the conditional correlation coefficients are time-varying and 
stochastic if kj 7^ for some j 6 {1,2,3}. For the model-specific parameters 
we take the following prior distributions: {"fjj, 4>jj)' ~ N(0, 100J)Jr_i x) (<^7'j)> 
~ IG(1, 0.005), lnA ji0 ~ iV(0, 100), k 3 - ~ C/(-vr,vr) (i.e. uniform over 
(— vr,7r)), j = 1,2,3. The BSV model can be obtained by imposing the pa- 
rameter restrictions K\ = K2 = K3 = in the P definition of the JSV model 
(but we formally exclude this value). 



44. TSV Model 

The next specification (proposed by ^F3\, thus called TSV) uses six sepa- 
rate latent processes (the number of the latent processes is now equal to the 
number of distinct elements of the conditional covariance matrix) . Following 
the definition in ^3], we propose to use the Cholesky decomposition: 

= L t GtL' t , 

where Lt is a lower triangular matrix with unitary diagonal elements, Gt is 
a diagonal matrix with positive diagonal elements: 
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Q2l,t 





1 

Q32,t 
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Q22,t 








Q33,t 



that is 



Qli.t Qll,tl2l,t 

2 

Q2l,tQll,t ?ll,*921,t + Q22,t 

Q31,tQll,t Qll,tl21,tQ31,t + Q22,tQ32,t 



9il,t93l,t 

Qll,tl21,tQ31,t + Q22,tQ32,t 
2 2 
9H,t?31.t + 922,i932,i + <?33,t 
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Series {qij.t}, and {Inqjjj} = 1,2,3, i > j), analogous to the uni- 
variate SV, are standard univariate autoregressive processes of order one, 
namely 

In q jj>t - 7ij - = (f>jj (In q jj>t _ , - 7ji ) + a 5j //^./. j = 1 , 2 , 3 , 
qij,t ~ lij = 4>ij(qij,t-i ~ Jij) + °ijVij,t, j, i S {1, 2, 3}, i > j, 

m = (vn,t,m2,t,v33,t,v2i,t,mi,t,V32,t)' ~ HiY 6 (0[ 6x i],/ 6 ), t g z, 

^t(4) = Q22,U <?33,t> q21,t, q31,t, <732,t) • 

Note that positive definiteness of St is achieved by modelling In qjjj instead 
of qjjt ■ It is easy to show that if the absolute values of <pij are less than 
one the TS V process is a white noise (see ^0] ) ■ We see that the TSV model 
is able to model both the time-varying conditional correlation coefficients 
and variances of returns. A major drawback of this process is that the 
conditional variances and covariances are not modelled in a symmetric way, 
thus the explanatory power of model may depend on the ordering of financial 
instruments. 

We assume the following prior distributions: 

(jij^ij)' ~ iV(O,lOOJ)/ ( _ 1)1) (0 ii ), 4 ~ JG(1, 0.005), Ing« )0 ~ N(0, 100) 
for i,j G {1,2,3} and i > j ; q ijfi ~ JV(0, 100) for i,j G {1,2,3}, i > j. The 
prior distributions used are relatively noninformative. Note that the BSV 
model can be obtained as a limiting case, corresponding to = <pij = 0, 
a\ -> for i,j G {1,2,3}, i>j. 

5. Empirical results 

We consider daily stock index returns for three national markets: Poland 
(WIG), the United States (S&P 500), and the United Kingdom (FTSE 100), 
from January 4, 1999 to December 30, 2005. We consider only index clos- 
ing quotations in trading days for all considered national markets, thus our 
sample consists of 1701 daily observations . The first observation is used to 
construct initial conditions. Thus T, the length of the modelled vector time 
series, is equal to 1700. In Table 1 we present the decimal logarithms of the 
Bayes factors in favor of TSVfsw model. Our posterior results are obtained 
using MCMC methods: Metropolis-Hastings within the Gibbs sampler (see 
[TT] . [7j and 0). The results presented in this paper are based on 500,000 
states of the Markov chain, generated after 100, 000 burnt-in states. The 

1 The data were downloaded from the websites (http://finance.yahoo.com) and 
http://www.parkiet.com/dane/dane_atxt.jsp where complete descriptions of the 
indices can be found. 
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Table 1. Logs of Bayes factors in favour of TSVfsw model 



Model 


Number of latent 


Number of 


log 10 (-B4,l,i) 


Rank 




processes 


parameters 






M 4 ,i (TSV F sw) 


6 


12+24 


0.00 


1 


M 4 , 2 (TSV FW s) 


6 


12+24 


7.82 


2 


M 4 , 3 (TSVswf) 


6 


12+24 


15.55 


3 


M 4 , 4 (TSVsfw) 


6 


12+24 


15.86 


4 


M 4 , 5 (TSVwfs) 


6 


12+24 


17.05 


5 


M 4i6 (TSV W sf) 


6 


12+24 


22.96 


6 


M 3 (JSV) 


3 


12+15 


63.68 


7 


Mi (SDF) 


1 


12+9 


87.39 


8 


M 2 (BSV) 


3 


12+12 


181.18 


9 



Bayes factors are calculated using the Newton and Raftery's method |S] • Be- 
cause in the TSV specification the conditional variances are not modelled 
in a symmetric way, we consider six cases: TSVfsw, TSVfws, TSVswf, 
TSVsfw, TSVwfs, and TSVwsf- These models differ in ordering of el- 
ements in y t . For example in the TSVfsw model yij denotes the daily 
growth rate of the FTSE 100 index at day t, and i/2,i and y^^ are respec- 
tively the daily growth rates of the S&P 500 and the WIG indexes at day t. 
Our findings show clear superiority of the TSV specifications (which de- 
scribe the six distinct elements of the conditional covariance matrix by six 
separate latent processes) over all SV models considered here. The TSVfsw 
model receives almost all posterior probability mass (assuming equal prior 
model probabilities), being about 7.82 orders of magnitude more probable 
a posterior than the TSVfws model and 63.68 orders of magnitude better 
than the JSV model. Furthermore, the TSVwsf model fits the data about 
23 orders of magnitude worse than the best TSV model. It is mainly at- 
tributed to the fact that the growth rates of the FTSE index are less volatile 
than the S&P and WIG indexes. When we compare the unconditional vari- 
ance of £j t t (Var^jj) = exp (7^ + 0. 5(7^/(1 — , j = 1, 2, 3) obtained 
in the BSV model, we observe a value of 1.448 for the WIG index, 0.955 
for the S&P 500 index and 0.943 for the FTSE index. It is in accordance 
with the ordering of returns in the best TSV model. Thus, the explanatory 
power of the SV model depends not only on the number of latent processes, 
but also on the ordering of financial instruments in case of the TSV speci- 
fications. The results indicate that the return rates of the WIG, S&P and 
FTSE indexes reject the constant or zero conditional correlation hypothesis, 
represented by the SDF and BSV model. 
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The main characteristics of the posterior distributions of the conditional 
correlation coefficients are presented in Figure 1, where the upper line rep- 
resents the posterior mean plus standard deviation the lower one - the poste- 
rior mean minus standard deviation. The conditional correlation coefficients 
produced by our VAR(1)-SV models with at least three latent processes vary 
markedly over time. Surprisingly, the TSV models with different ordering 
of the returns lead to different posterior inference on the conditional co- 
variances. The differences in the dynamics of conditional correlations are 
understandable because of the structure of the conditional covariance ma- 
trix. In the TSV models the conditional covariance between £1^ and £2,* 
(similarly between £i t and £3^) depends on the variance of £1^ (i.e. qn,t)- 
Thus, a large increase in the conditional variance of £i 5 t leads to an increase 
in the conditional covariance. Therefore the TSVwsf and TSVwfs models 
(in which the WIG index is the first component) lead to similar inference 
on the dynamics of the conditional correlations. The plots of the posterior 
means of pijt, obtained in the remaining TSV models are different (be- 
cause of differences in volatilities of the S&P500, FTSE indexes and WIG 
index). Note also that in the JSV model the latent processes that describe 
volatilities are included in the conditional correlation coefficient definitions. 
Consequently, the conditional correlations depend on the volatilities. Sur- 
prisingly, in the SDF model the conditional correlations are estimated very 
precisely - the posterior standard deviations of pijj are relatively small. The 
returns on the WIG index are lower correlated with returns on the S&P 500 
index (with an average of 0.18) than with returns on the FTSE index (with 
an average of 0.24). This low correlation is partially explained by the non - 
overlapping trading hours of U.S. market with the European markets. The 
U.S. market (represented by the S&P 500 index) has the average correla- 
tion of 0.47 with the U.K. market. Finally, it is important to stress that our 
results show that the conditional correlations are not significantly higher 
when world markets are down trending, which is in contrast to the results 
presented in the papers: (Jj, ^21 > E3 
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Fig. 1. Conditional correlation coefficients (posterior mean ±1 standard deviation) 



